An empirical analysis of word error rate and keyword error rate

نویسندگان

  • Youngja Park
  • Siddharth Patwardhan
  • Karthik Visweswariah
  • Stephen C. Gates
چکیده

This paper studies the relationship between word error rate (WER) and keyword error rate (KER) in speech transcripts and their effect on the performance of speech analytics applications. Automatic speech recognition (ASR) systems are increasingly used as input for speech analytics, which raises the question of whether WER or KER is the more suitable performance metric for calibrating the ASR system. ASR systems are typically evaluated in terms ofWER.Many speech analytics applications, however, rely on identifying keywords in the transcripts—thus their performance can be expected to be more sensitive to keyword errors than regular word errors. To study this question, we conduct a case study using an experimental data set comprising 100 calls to a contact center. We first automatically extract domain-specific words from the manual transcription and use this set of words to calculate keyword error rates in the following experiments. We then generate call transcripts with the IBM Attila speech recognition system, using different training for each repetition to generate transcripts with a range of word error rates. The transcripts are then processed with two speech analytics applications, call section segmentation and topic categorization. The results show similar WER and KER in high-accuracy transcripts, but KER increases more rapidly than WER as the accuracy of the transcription deteriorates. Neither speech analytics application showed significant sensitivity to the increase in KER for low-accuracy transcripts. Thus this case study did not identify a significant difference between using WER and KER.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modified signed log-likelihood test for the coefficient of variation of an inverse Gaussian population

In this paper, we consider the problem of two sided hypothesis testing for the parameter of coefficient of variation of an inverse Gaussian population. An approach used here is the modified signed log-likelihood ratio (MSLR) method which is the modification of traditional signed log-likelihood ratio test. Previous works show that this proposed method has third-order accuracy whereas the traditi...

متن کامل

Effect of light color temperature on selective attention, error rate and reaction time

Investigating the effect of light color temperature on selective attention, error and human reaction time Abstract Background and aims: In humans, the reaction time limit is associated with several factors. It includes the time that takes to stimulate the sensory member, the stimulus effect is transmitted to the brain, then is perceived and the decision is made; consequently, the command resu...

متن کامل

Keyword-based Discriminative Training of Acoustic Models1

In this paper, we investigate a new discriminative training technique which focuses on optimizing a keyword error rate, rather than the error rate on all words. We hypothesize that improvements in keyword error rate correlate with improvements in understanding error rates. Keyword-based discriminative training is accomplished by modifying a standard minimum classification error (MCE) training a...

متن کامل

An Empirical Analysis of China’s International Reserves Demand Function

The study aims to estimate an international reserves demand model for China using economic growth, propensity to import, real effective exchange rate and trade openness variables for quarterly period spanning from 1985Q1 to 2014Q4.The bounds testing technique to cointegration is used to test for a long run relationship, while the autoregressive distributed lag approach is used to estimate short...

متن کامل

Evaluation of the relationship between the uses of safety procedures in the rate of human error in Yazd Combined Cycle Power Plant

Introduction: About 60 to 90 percent of an accident in the industry is caused by human error. This study aimed to assess the effectiveness of safety procedures in reducing human error in Yazd Combined Cycle Power Plant employees.   Materials and Methods: The present study is a quasi-experimental intervention that was conducted aimed to measure the human error of 121 employees of Yazd Combined...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008